CDS

Accession Number TCMCG036C07033
gbkey CDS
Protein Id PTQ43690.1
Location complement(join(242013..242366,242499..242641,242888..243113,243278..243703,244043..244384,244639..246228))
GeneID Phytozome:Mapoly0023s0026
Organism Marchantia polymorpha
locus_tag MARPO_0023s0026

Protein

Length 1026aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA53523, BioSample:SAMN00769973
db_source KZ772695.1
Definition hypothetical protein MARPO_0023s0026 [Marchantia polymorpha]
Locus_tag MARPO_0023s0026

EGGNOG-MAPPER Annotation

COG_category T
Description domain-containing protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K19330        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04140        [VIEW IN KEGG]
map04140        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005794        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0012505        [VIEW IN EMBL-EBI]
GO:0030154        [VIEW IN EMBL-EBI]
GO:0032502        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0045445        [VIEW IN EMBL-EBI]
GO:0048856        [VIEW IN EMBL-EBI]
GO:0048869        [VIEW IN EMBL-EBI]
GO:0061061        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGCTGGACGAAGAGTTTGAGAGTCCAGAGAAACTTAATCGAGCTTTGGGAGTCGATGTGGAGAGCAGCAGATGTGAAGGTTTCGTTGGATCCAGTCCATCACCCTACATTGCGGAAGGATCTGCCGATTGTTGCCCGGAATGGGAAAACAACGCCAAAGTCACCGCTACAGCGGTCAATTCCGACACCGGACAGGGGCACTATGGGATATCCTATACTGCTCGCCCAATCAATATTGACGCAAGTGGGGAGGCAAATGTTCAAGTTGAGGGAAACGAGAAGGATCAAGTGTTTACAGAAGATATCGATGATCAAACACTTCCTGATAGTCTCGAACGAGGGTCCGCTGTCATGACAGAAGACCATGACAGGGTAGTGAGTGTTGAAGAGAATTGTGACCAAGATATTCTGGAAGAAAAGGATTTGGAAGCAACTAGTAGCATATCGGAATACAATTCACTTACTGGCAGAGATTCCGATGATGATGACAGTTCTATGTTTGGTGCGAGTCCATCCAAATACTTCCAGGCATTGAGTAAAACTAATAGCACTTGCGACCAGTTTTTCAACTCCGAAGACGAATTAGCGTTTGGGTGCAGTGATTGGAGCGAGTACGTTCATCAGATTGGGGGCGAGTTGGGTTCTTCTTTTGTCGAGCTTCAAGAGGCGAGGAAGGATTTACATCCAGTGCTTGGAGTAGGACCTCCGTTGGATAGGTTTCTCGATGACTCGGAGGACAAAAGTATTGATGCTGGTGAAGATGGAGATAGGTCAGACGGTGATCTGCCACCGGAGGAGGTTTCTCGAGTAAGGGGCGCCGGTAGACTTGATCTTGACGACGGTGATGAAGATGTCACTGCCGCGGACGGAGAAAGTTCTCTTACTGGTCCTGCGTCTTTTTCCACAACTTTACCGGCCTTTCACTCAAATGAGGGATGTGGGATAGCTTTGTTCGGAGAGGATGAAAATACGATTGACTACGACGATTTTTGGGACCTTACGTCAGGGTCTCACCTAACGTCCTTGAGAGACGCAACTGACGACTCCTTAGGAGTTCATGGAGAGGTCGAGGAGCACATTATTGAAGAGCTTAAGATTTTACCTTCCCAGGAGGACTTTAGTGCAGAAGCACATGGATTGCCTAATAACCTTAGAGAGTCAGGCGACTTTAGTTTGCTTGTTCAAGAAAGTGTGGAACCAATGTCGGAAGCGGGGAATAAGATAGACTTGCAAGTTCGAACTCTAGACGAGAATTCCGTATCTGGAATAGCTTGTCATACAGGACTCAAACTCAAAGTGGATGTTAATCGTACAAGGGAAACGCCTAAATTGCAACAAGTGTCAGCAGAGAGTCATCTCCTGGAAGATGTCGCTCCAGGAATTTCTGTCGGAGATCACTGCAAAAACGGGAGTCGCATCGTAGATCTTGGCCTGAAGGCAGCTGACATCATCCAGGACCAGGTGCATTGCACATCTCTATCACCGGGGAAAGTCGTAGGCTCGGCGCCGTCAACACCAGGAAAATCCCTTGAAGATGAGAGTAAGCGAGCAGGTGTGGATTGGGCCGAAGAGGATCTGGGGCCTGACGAGCTCTCAAGATATGTAGAGGCACTGGAAGTCAAGGAGACCTACATAGACACTGTGCTGGATATGGAAGACGTCCTTTTCGACGGGGAGGGTGGAGGTGGCAGTCGAACTTTTCAAGTTGGAAAGGTCACGTCACCCGGTTTCGTTCGTCCAGTTCGAGATGGCAGCCTGAGCGCTTCCACATCAAGTGTTCTGGCATTGGCCTCGAGACAGATCTCTCATCCCACGCGCCTGCTGAGCGACGTAGACTGGGTCGAAGTGGTGGGTGCTGTACAACGTCATGGCGGTGCTTCTCTGGGAGAGCGAGTCGTTGGCGTCAAGCAACACACGGTATATCGTATAAAGGACCAATCGGGATCAACGTTGCCCCTGCAGGAGTTTGGACTTGACTCACCAAAATCCGACATCTCGAGTAATGCACCTGATCTTGAGGATGACGGAACCCAGTCTTCAGTTTTAGGGAAAACCATAAGATTGATTGTACAGATCCACAAGAAAAAGCCTTTGCGACAGCAGCTTCAGGCACAACATTATACCTGTGCTGGTTGCTACAAGCGTCTGGAGCTGGCCCTGGGAATTGTCCCAGAACTTGTACAAAATTGGGGTTGGAGGGGACCAAGACTTTGTGAATACACCGGTCAATTGTTCTGTTCTACTTGTCACTTGAATGAAACGGCTGTCTTGCCAGCATGGGTTCTGCAGCGGTGGGATTTTACTCCGCGCCTTGTTTCGCAACTTGCAAAAGCTTACTTGGACTCAATATATGATAAGCCGATGTTGTGTGTTAGTGCCGTCAATCCATATTTGTATGCAAGGGTACCAGTTCTCGCACATTTGACAGAGATGAGAAGAAAAATTAACAAGATGCTGGCTTGCATCCGCTGTCCGGCCCGCACAAGAATTCAAACGATGCTCGGCTCCCGTCGGTACCTCTTGGAGAACAATGACTTTTGCGCCCTTCGTGATCTTGCGGACTTGTCCAAGGGTGCGTTTGCAGTCTTGCCTGGATACATGCGAGCCGTCCTTTTGAAGCTGTCGTCCCACATAACCAGAGAGTGTTTTCTTTGTCGAGAGCTTGGAGAGCCTTGTGGTGCCGGGGAGTTGTGCTACGACGAGTACGACGTCATTTACCCACACCAGGACGAACTCATCGTTAGATGTCCCTCATGTCAGCATCCTTTCCATAAAAGGTGCTACGCCAAATGCCAGAAATGCCCTTCCTGCAGAGGACAACCGGAGCTGAAACGAAATGATTCTTTGCTCACCGTTCAGCAACATGGTGAGCATGCGGGTGAATTTAGTGCAAGTAAAGCTTCTCCGGAACCGCTGAAAAGGACAGAGTCTCTGACTTCACCTGCTAATCCCAGAGAGAATAAGTCATCAACAAGAAGAAGTCTTTTCGCAAATTTTCTTGGTTCAAGAGAAGCTCGAAGTCCGGAGCAGAAGAAAGAGATAATAAATATGAATCCCCTTTCTAGTCCTATAGAATTGTAA
Protein:  
MLDEEFESPEKLNRALGVDVESSRCEGFVGSSPSPYIAEGSADCCPEWENNAKVTATAVNSDTGQGHYGISYTARPINIDASGEANVQVEGNEKDQVFTEDIDDQTLPDSLERGSAVMTEDHDRVVSVEENCDQDILEEKDLEATSSISEYNSLTGRDSDDDDSSMFGASPSKYFQALSKTNSTCDQFFNSEDELAFGCSDWSEYVHQIGGELGSSFVELQEARKDLHPVLGVGPPLDRFLDDSEDKSIDAGEDGDRSDGDLPPEEVSRVRGAGRLDLDDGDEDVTAADGESSLTGPASFSTTLPAFHSNEGCGIALFGEDENTIDYDDFWDLTSGSHLTSLRDATDDSLGVHGEVEEHIIEELKILPSQEDFSAEAHGLPNNLRESGDFSLLVQESVEPMSEAGNKIDLQVRTLDENSVSGIACHTGLKLKVDVNRTRETPKLQQVSAESHLLEDVAPGISVGDHCKNGSRIVDLGLKAADIIQDQVHCTSLSPGKVVGSAPSTPGKSLEDESKRAGVDWAEEDLGPDELSRYVEALEVKETYIDTVLDMEDVLFDGEGGGGSRTFQVGKVTSPGFVRPVRDGSLSASTSSVLALASRQISHPTRLLSDVDWVEVVGAVQRHGGASLGERVVGVKQHTVYRIKDQSGSTLPLQEFGLDSPKSDISSNAPDLEDDGTQSSVLGKTIRLIVQIHKKKPLRQQLQAQHYTCAGCYKRLELALGIVPELVQNWGWRGPRLCEYTGQLFCSTCHLNETAVLPAWVLQRWDFTPRLVSQLAKAYLDSIYDKPMLCVSAVNPYLYARVPVLAHLTEMRRKINKMLACIRCPARTRIQTMLGSRRYLLENNDFCALRDLADLSKGAFAVLPGYMRAVLLKLSSHITRECFLCRELGEPCGAGELCYDEYDVIYPHQDELIVRCPSCQHPFHKRCYAKCQKCPSCRGQPELKRNDSLLTVQQHGEHAGEFSASKASPEPLKRTESLTSPANPRENKSSTRRSLFANFLGSREARSPEQKKEIINMNPLSSPIEL